(12) INTERNATIONAL AH»LICATION PUBLISHED UNDER T:HE PATENT COOPERATION TREATY (PCT) 



(19) Worid Intellectual Property Organization 

Internationa] Bureau 

(43) International Publication Date 
4 September 2003 (04.09.2003) 




PCT 



niDiiniiimHiiiiiiiiMii 



(10) International Publication Number 

wo 03/073731 Al 



(51) International Patent Classification'': H04M 1/64, 

3/00, H04L 12A)0, 12/66, G06F 13/00 

(21) International Application Number: PCT/US03/03129 

(22) International FiUng Date: 26 February 2003 (26.02.2003) 

(25) Filing Language: English 

(26) Publication Language: English 



(30) Priority Data: 
10/083.472 



27 February 2002 (27.02.2002) US 



(71) Applicant: SBC PROPERTIES, L.R [US/US]; 6500 
River Place Blvd., Building HI; 1st Floor, Austin, TX 
78736-1111 (US). 

(72) Inventors: PAROLKAR, Satish; 8807 Tweek Berwick 
Drive, Austin, TX 78750 (US). BHAVSAR, Vikram; 4445 
Lost Oasis Hollow, Austin, TX 78739 (US). 



(74) Agents: BERNSTEIN, Bruce, H. et al.; Greenblum & 
Bernstein, P.L.C., 1950 Roland Oaike Place, Reston, VA 
20191 (US), 

(81) Designated States (national): AE, AG, AL, AM, AT, AU. 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH. CN, CO. CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, FT. GB, GD, GE, GH, 
GM, HR, HU, ID, XL. IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD. MG, MK, MN, MW, 
MX, MZ. NO. NZ, OM. PH, PL, FT, RO, RU, SC, SO. SE, 
SG, SK, SL, TJ, TM, TN, TR, TT. TZ. UA. UG. UZ, VC, 
VN, YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM. 
KE, LS, MW. MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM. AZ, BY, KG, KZ, MD, RU, TJ, TM). 
European patent (AT. BE. BG, CH, CY. CZ, DE, DK, EE, 
ES, H, FR, GB, GR, HU, IE, IT, LU, MC, NL, FT. SE, SI, 
SK. TR). OAPI patent (BF, B J. CF, CG, CI, CM. GA, GN. 
GQ, GW. ML, MR, NE, SN, TD, TO). 

[Continued on next page] 



(54) Title: MULTI-MODAL COM3V1UNICATIONS METHOD 



USERKV2CE 

lAPTOPCoiytPirrER 

101 



USERI^VICE 
PDA 
102 



TEl£PHONE 
195 



o 
O 



USERDEVJCE 
PEN COMPUTER 
1(» 




(57) Abstract: Comunications between . 
an agent (5302) and a party (5301) 
using a conununications device . (102) 
involve sending an interactive script 
using session initiation protocol (SIP) 
to the communication device (102). The 
interactive script gathers information from 
the communications device or a user of 
the communication device (102) and the 
requested information is then forwarded 
to an agent terminal (170) using SIP. The 
interactive script is written in a high-level 
language and securely stored, such that 
an authorized party can easily access and 
alter the script to suit a need for particular 
information. 



150 
CAUL QUEUE 



0:^/073731 Al imWIIIIillllliliilllliliiliillMiiiiil 



Published: 

— with international search report 



For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations'* aj^aring at the begin- 
ning of each regular issue of the PCT Gazette. 



wo 03/073731 PCT/US03/03129 

MULTI-MODAL COMMUNICATIONS METHOD 

The present application expressly incorporates by reference herein the entire 
disclosure ofU.S.Applicatipn No. 10/083,472, entitled "Midli-modal Conmnmications 
Method", filed February 27, 2002. 

RAr:KGROTTNT> OF THF. TNVRNTION 

1. Field of the Invention 

[0001] The present invention relates to data collection. More particularly, the 
present invention relates to collecting information from a smart device or a user of a 
smart device by sending a message with an interactive script in a generic format to the 
smart device for processing. 

2. Background Information 

[0002] In a conventional call center environment, an mteractive voice response unit 
(TVR) may be used to collect information. Collected information may be used, for 
example, to determine the purpose of the call so that the call may be efficiently routed 
to an appropriate agent As an alternative, a human screener may be used to collect 
information ftom title caller and transfer the call to an appropriate agent for handling. 
[0003] An IVR may operate by sCTiding voice instmctions to the caller and receiving 
digital input signals in response from the caller. An IVR may also be provided with 
speech recognition software to recognize verbal responses from the caller. 
[0004] In the internet age, an intemet user may attempt to contact a human or 
programmed automated customer service agent operating through a web-site. The 
web-site may present the user vsdth a form in order for the user to mput information 
manually. In addition, a conventional web-site may be programmed to transmit a so- 
called "cookie", or applet, to a user's browser in order to facilitate future 
communications between the user and the web-site by collecting and storing 
information about the user. 

[0005] hi order to inq)rove efficiencies in the communications system, a next 
generation communications device has been developed that is programmable, i.e. 
"smart". A smart device enables a user to commxmicate via voice and/or text modes. 
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'A smart device is any device that contaiBS a micro-processor to process instructions 
. of a software program. Such smart devices contain internet browsing capability in 
addition to conventional transmission and reception of voice and data capabilities, 
order to navigate the internet eiflier via wireline or wireless networks. 
[0006] Currently, an interactive voice dialog may be established tiirou^ the inlmiet 
via a click-to-chat, ciick-to-talk mechanism, over conventional wireline or wireless 
telephones. As an example, in a contact center environment, a customer connected to 
the mtemet may click on a screen icon to reach the call center. When the customer 
clicks to contact Ae call center the call is first sent 1» a call queue where it waits for 
a call agent to become avaUable. Once an agent is available, the customer call is 
transferred firom the call queue to the call agent. 

[0007] Calls received via the cUck-to-talk mode are routed based on the URL at 
which the chck to talk button was pressed or a series of cookies that gathered the 
user's actions. The caUer is not asked for the intent of the call in advance, which can 
lead to delivering the call to an improper agent, resulting in unnecessary transfer of the 
caller between agents. 

[0008] Furthermore, a next generation communications device may be able to 
communicate using several differentmodes, either alternatively or simultaneo^^^^ 

example, on a single internet-enabled device, a caller may use any of tiie pubUc 
switched telephone network (PSTN), a wireless network, or a cable to attempt to 
contact an agent via analog or digital voice, or voice over mtemet protocol (VOIP). 
In addition, a caUer using tiie internet may attempt to contact an agent via a text 
interfece using an IP format A portable communications device may simultaneously 
support several modes of communication, such as presently available voice and text 
transmission formats, as well as other transmission formats as they are developed. 
[0009] A smart device will be able to run a program tiiat collects information so that 
information may be automatically collected for a requestor. To date, an integrated 
multi-modal method of collecting information has not been provided in a single 
system Moreover, a method is needed to transmit and receive questions and responses 
over a network in a common format that is independent of a particular communications 
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mode or format. AdditionaUy, a method is needed for automated data-coUection such 
that a questiomiaire can be easily modij&ed from any device connected to the network 
using a compatible modaUty. Accordingly, a system is needed to automate the 
collection of information from next generation smart devices and their users. 

JVRTKF DESCRTPinON OF THE DR AWINGS 
[00101 The present invention is further described in the detailed description that 
follows, by reference to the noted drawings by way of non-limiting examples of 
embodiments of the present invention, in which like reference numerals represent 
similar parts throughout several views of the drawings, and in which: 

FIG. 1 shows an exemplary communications network architecture for collecting 
information; 

FIG. 2 is an exemplary flow diagram showing a method of collecting 
information; and 

FIGS. 3A and 3B show an exemplary call flow diagram showing the various 
steps for sending and receiving an interactive script to be executed to a user device; 

T>F.TAn.ED nESCRIPTTON OF TH F. TTWENTION 
[00111 In view of the foregoing, the present invention, through one or more of its 
various aspects, embodiments and/or specific features or sub-components, is thus 
intended to bring out one or more of the advantages as specifically noted below. 
[0012] In one aspect of Ihe present invention, a method of collecting information 
from a communications device of a user for use by a recipient device is provided. The 
method includes establishing a communications connection between the 
communications device and the recipient device. An interactive text markup 
programming language script is sent to the communications device using a session 
initiation protocol (SIP) message. The script includes at least one query. In response 
to the query, an answer is received from the communications device. The answer is 
generated by the communications device and/or based upon an input by the user. 
[0013] In another aspect of the present invention, the method of collecting 
information includes sending a message to either a proxy server or a redirect server to 
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• establish the comnmnications connection between Ihe communications device and the 
recipient 

[00141 In yet another aspect of the present invention, the received answer includes 
information of a location of tiie communications device, a lype of the communications 
device, a communications format used by the communications device, a 
communications mode desired by the user of tiie communications device, a personal 
identification of a user of the communications device, an account number of a user of 
the communications device, a password of a user of the communications device, billing 
information of a user of the communications device, tiie intent of the user of the 
communications device, a preferred language of a user of the communications device, 
and/or a question from the user of the communications device. 
[00151 In another aspect of present invention, the received answer is a textual 
representation of either a DTMF tone, VoiceXML or HTML speech tags. In yet 
another aspect of tiie present invention, tiie mefliod of collecting information includes 
providing tiie answer to a user of tiie recipient device. In still anotiier aspect of tiie 
present invention, tiie interactive script includes a first query and a second query tiiat 
depends on tiie answer to the first query. In anotiier aspect of tiie present invention, 
tiie metiiod of collecting information includes tiranslating tiie answer to tiie interactive 
, script into text. 

[00161 According to a fiirflier aspect of tiie present invention, a metiiod of 
determining a final call destination for a caUer using a communications device is 
provided. The metiiod includes sendmg an interactive text markup programming 
language script, including at least one query, from a call queue to tiie communications 
device using a session initiation protocol (SIP) message. An aitssDopt is made to receive 
an answer to each query from the communications device witiiout requiring an action 
by tiie user of tiie communications device. For each query tiiat is not answered by the 
communications device, tiie query is presented to tiie caUer via a user interfece 
associated with flie communications device. When an answer(s) to tiie at least one 
query has been received from the cormnunications device or tiie caller, the script and 
tiie answers to tiie caU queue are retiimed. The call destination is determined based on 
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■ the answers to the at least one query. 

[00171 In another aspect of the present invention, the method of determming the 
final call destination includes sending a message to either a proxy server or a redirect 



server. 



[0018] In still another aspect of the present mvention, the received answer includes 
information of a location of the communications device, a type of the communications 
device, a communications fonnat used by tiie communications device, a 
communications mode desired by the user of tiie communications device, a personal 
identification of a user of the communications device, an account number of a user of 
the communications device, a password of auser of the communications device, billing 
information of a user of the communications device, the intent of tiie user of the 
communications device, a preferred language of auser of tiie communications device, 
and/or a question from the user of the communications device. 
[00191 In anotiier aspect of tiie present invention, tiie received answer is a texhial 
representation of one of a DTMF tone, VoiceXML and HTML speech tags. In yet 
anotiier aspect of tiie present invention, tiie answers to tiie query are provided to an 
agent at tiie final call destination. In stiU anotiier aspect of tiie present invention, flie 
interactive script includes a first query and a second query tiiat depends on tiie answer 
to tiie first query. In anotiier aspect of tiie present invention, tiie metiiod of 
determining tiie final call destination includes translating tiie answers to the interactive 
script into text 

[0020] In another aspect of the present invention, a method of interactively pre- 
screening caller information of a caller using a communications device is provided. 
The method includes estabhshing a communications connection between an 
information service and the communications device. An interactive text markup 
programming language script, inchiding at least one query, is sent from tiie information 
service to tiie communications device used by tiie caller, using a session initiation 
protocol (SIP) message. At least one answer to flie query is integrated into tiie 
interactive script, the answer being generated from tiie communications device and/or 
generated from input by tiie caller. The script, including tiie at least one answer, is 
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" received at the information service. 

[0021] In another aspect of the present invention, the method of interactively pre- 
screening caller information inchides establishing a second connection between the 
commnnications device and one of multiple agent devices, using SIP. The particular 
agent device is determined based on the at least one answer. In yet another aspect of 
the present invention, the method of interactively pre-screening caller information 
includes, sending a message to either a proxy server or a redirect server to establish the 
communications connection. 

[0022) In yet another aspect of the present invention, the received answer includes 
information of a location of the communications device, a type of the communications 
device, a communications format used by the communications device, a 
communications mode desired by the user of the communications device, a personal 
identification of a user of the communications device, an account number of a user of 
the communications device, apassword of auser of the communications device, billing 
information of a user of the communications device, the intent of the user of the 
communications device, a preferred language of a user of the communications device, 
and/or a question from the user of the communications device. 
[0023] Jn anotiier aspect of the present invention, the received answer is a textual 
representation of either a DTMF tone, VoiceXML or HTML speech tags. In still 
another aspect of the present invention, the method of interactively pre-screening caller 
information includes providing the answer to an agent of tiie mformation service at an 
agent terminal. In yet another aspect of the present invention, the at least one query 
of the interactive script includes a first query and a second query that is based upon a 
first answer to the first query. In another aspect of the present invention, the method 
of interactively pre-screening caller information includes translating the answers to tiie 
interactive script into text. 

[0024] hi a further aspect of the present invention, a con:q)uter readable medium is 
provided for storing a computer program that conti-ols collection of information firom 
a communications device of a user for receipt at a recipient device. The computer 
readable medium includes a session initiation protocol (SIP) segment tiiat creates a 
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• session initiation protocol message that is sent to the commumcations device. The SIP 
message includes an interactive text markup programming language script with at least 
one query. The computer readable medium also includes a data processing segment 
that receives and analyzes an answer from the communications device in response to 
the at least one query. The answer(s) are generated by the commumcations device 
and/or based upon an input by the user, 

[0025] In a ftirther aspect of the present invention, a data reception system that 
receives collected data from a communications device of a caller is provided. The data 
reception system includes an automated call distributor tiiat receives a call from a user 
device over an IP network. The data reception system also includes a call queue that 
receives the call from the automated call distributor and sends an interactive text 
markup programming language script, including at least one query, to the 
communications device of the caller using a session initiation protocol (SIP) message. 
A media gateway is provided tliat Iranslates the script into a format compatible with 
flie communications device of liie caUer when the commumcations device of tiie caller 
does not support the modaUty of the script as sent by the call queue. When an answer 
to the at least one query has been received from the commimications device or the 
caller, the answers are processed at the call queue. 

[0026] A communications system is provided for facilitating commuiucations 
between a user (client) of a communications device and a recipient, such as an 
information service, customer service agent or tiie like. The communications system 
and method of the present invention may also be utilized by any party having a use for 
caller information, such as a small business or even a private citizen in some cases. 
The communications system supports multiple connnunications modes, at least 
including voice and text, so that an agent may communicate with a user using any of 
the multiple modes. As used herein, the term "agent" should be considered in the 
context of an end recipient of a call, which may include electronic peripheral devices. 
[0027] Calls may use any of a multiple formats, including traditional voice calls 
over the plain old telephone system (POTS) or an advanced intelligent network (AJN), 
voice calls over a wireless network such as a third generation partnership project 
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■ (3GPP) network, voice calls iising the internet (VOIP), text messaging using any of the 
formats used for voice calls, or any other type of communications carried out over a 
network. Therefore, the terms "call" or "caller^' should be considered in the context 
of any type of interactive electronic communications betweoi two parties or electronic 
devices. 

[0028] In one embodiment, a call recipient is an agent associated with a call center 
including a call queiie and an automated call distributor (ACD). A caU queue is 
typically a server tbat tracks and interacts with incoming calls. The automated call 
distributor receives calls from a customer (user/client) and may connect the caU to a 
call queue, either automatically or after determining that all agents are unavailable. 
[0029] An exemplary network for user devices to contact an agent is shown in FIG. 
1. A user using an IP-compatible laptop computer 101, personal digital assistant 
(PDA) 102, pen conq)uter 103, or workstation 104 is connected to communications 
network 110. Communications network 1 10 is an IP networic that can be contacted by 
a user device by, for example, dial-up. User devices 101, 102, 103 and 104 may also 
be connected on a permanent or semi-permanent basis to tiie network 110 by 
connections such as cable hook-ups or wireless connections, e.g., a wireless network. 
[00301 In the embodiment of FIG. 1, tiie communications network 1 10 is connected 
to an automated caU distributor (ACD) 160. The automated call distoibutor 160 is 
connected to a call queue 150, such as a server. As noted herein, a caU queue 150 
tracks and communicates with calls after tiiey are received at a destination. The caU 
queue 150 may communicate witii calls to ensure tiiey are forwarded to agents in a 
determined order or priority, such as tiie order in which tiiey are received or a most 
appropriate agent as determined by an agent or agent/caller scoring system. When an 
appropriate agent is available, the call queue 150 is notified and mitiates a call tiansfer. 
The call transfer may include the initiation of anotiier call and tiie estabUshment of 
anotiier connection directly from tiie customer to tiie call agent tiirough tiie ACD 160, 
in which case tiie call queue 150 may wait for notification of a successfiil connection 
between tiie customer and tiie agent before removing tiie call session from tiie queue. 
A connection is estabUshed witii an agent at, for example, a workstation, such as 
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workstation 170, for processiag. 

[00311 In the case where the commmiicatioiis network 1 10 is the PSTN, a media 
gateway/proxy controUer 190 may be used as an intermediate proxy for the caller's 
device. As described herein, liie media gateway/proxy controller 190 may serve as a 
proxy or translator for a user with a conventional communications device 195, such as 
a conventional telephone. In the case of a user with a conventional telephone, the 
media gateway/proxy controller 190 can receive an interactive script and translate the 
script into voice response dialogue to be played to the caller. 

[00321 The information collection method of the present invention in tiie call center 
embodiment is shown in FIG. 2. At step S201, a call is received from a customer at 
the ACD 160. At step S202, the call is forwarded from the ACD 160 to the call queue 
150. 

[00331 In the call center embodiment, xtpon receipt of the call at the caU queue 150 
in step S202, the receiving apparatiis, in this case the call queue 150, compiles and 
forwards back to the user's device a session initiation protocol (SEP) MESSAGE with 
an interactive script in the message portion at step S203. SIP and SIP MESSAGES are 
described in detail below. The interactive script may be in a programming format that 
is recognizable by the user's device, such that tiie user's device processes the 
intCTactive script. As described below, in one embodiment tiie interactive script is in 
a text mark-up language, such as extensible markup language (XML). The script that 
is forwarded to the cUent device may be utilized by an applet embedded in a browser 
of the user's device to collect information from the user's device. The information 
may have been previously partially or folly collected and stored by tiie browser of tiie 
user's device in response to previous queries by eitiier tiie called party or any other 
party prompting tiie user's device to collect information, hi tiie altemative, tiie 
browser may collect tiie information from tiie chent device based only upon tiie newly 
received XML script. Additionally, when the media gateway/proxy contioller 190 or 
a similar device is necessary to process a script and translate script queries into voice 
signals for tiie caller, tiie infonnation collected from tiie caller may not have been 
previously stored at tiie user's device, such as when tiie user's device is a legacy 
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telephone handset. 

[0034] The interactive script may process information from the user's device or the 
\iser in a set order. For example, the interactive script may present individual questions 
to the user and then wait until a response has been provided by the user before 
proceeding to the next question. In the alternative, the interactive script may present 
a first question to a user and then decide a second question to present to the user based 
upon the user's reply to the first question. For example, if a user is first asked what 
language they speak, the subsequent questions may be presented in the language 
selected by the user. In addition, a user may be asked the purpose of their call, in 
which case the subsequent questions from the interactive script may depend on the 
answer provided by the user. The script may also seek to obtain information from the 
user's device automatically by collecting data that has been previously collected and 
stored at the user's device or a proxy or associated device, such as a database 
associated with the media gateway/proxy controller 190. The information may have 
been automatically collected by a script forwarded with an applet or the information 
may have been collected in previous communications involving a script that was sent 
to an applet 

[0035] Information may also be pre-programmed at the smart device by the user. 
For instance, a user's device naay be pre-programmed to indicate that tiie user speaks 
a particular language. Accordingly, when a SIP questionnaire requests information on 
a language used by the caller, the user's smart device automatically provides the 
answer without requiring input from the caller, thereby affecting the format of any 
subsequent queries presented to the user. Additionally, the smart device manufacturer 
may pre-program information such as a model number, date of purchase, technical 
specifications including modalities supported by the device, memory size and 
processing speed. In addition, third party service providers may provide information 
for automatic collection, such as billing information, credit verification and 
authorization and security protocols. In an embodiment, the information collected 
using the SIP questionnaire wiH be automatically collected at the user's device without 
requiring responsive input by the caller at the time of the call. 
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' [0036] A SIP MESSAGE with a text markup language script and replies is received 
at the automated call distributor 160, and forwarded to the call queue 150 in the call 

center embodiment, at step S204. The answers of iiie script are processed at step S205. 
The answers may be utilized to determine a proper agent to handle the call, such as in 
the case where a caller speaks a foreign language. In addition, answers may be 
presented in text format to the agent that receives the call to expedite processing of the 
call. Answers may also be stored in a database associated with the call center, in 
which case the answers may be compiled and statistically analyzed to enhance call 
center operation. 

[0037] In the embodiment where the agent is associated witii a call center, the 
automated call distributor 160 will connect the call to a call queue 150 where it waits 
for the next available agent Accordingly, the SIP messaging occurs while the call is 
waiting at the call queue 150. As agents become available they will notify the call 
queue 150 that they are available to take a new call. If there is a call waiting, then the 
caiU queue 150 responds by informing the agent that there is a call in the call queue 1 50 
and then initiating a call transfer from the call queue 150 to the available agent As a 
result, a call is established between the customer and an agent The customer device 
notifies the call queue 150 that the transfer was successful and the call queue 150 
removes the call from the call queue 150. 

[0038] SIP Messages are used in communications involving an IP-enabled network. 
The SIP •'MESSAGE & NOTIFY" is used to initiate a call to visit an address, such as 
when a user types in a web-site's URL. The "caU" involving SIP is to be distinguished 
from an initial call in which the user contacts the network, such as a dial-up session. 
However, it should be noted that SIP is used when a user of a smart device connected 
to the network wishes to utilize voice communication capabilities to perform a 
secondary call, such as by clicking on a "Click to Talk" button on a web screen to call 
another party , using the web-site providing the web screen as an intermediary. 
[0039] SIP provides necessary protocols for end and intermediate systems to use in 
providing services. Services can be provided for sessions including, but not Hmited 
to, iatemet conferencing, telephony and instant messaging sessions. SIP can also be 
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used to initiate calls between legacy-type telephone devices, if the necessary switches 
are suitably equipped. However, it should be noted that a legacy-type telephone device 
that is not IP-compatible would not necessarily have a particular use for SIP except in 
cooperation with additional network elements such as the media gateway/proxy 
controller 190, or a peripheral device performing a similar function. Services that can 
be provided using SIP protocols include IVR services, call forwarding, number 
deUvery/caller ID, personal mobility, terminal-type negotiation and selection, caller 
authentication, call transfer and invitations to multicast conferraices. In cases such as 
call forwarding or authenticating caller ID, a proxy server may also negotiate the call 
using Sn*. Furthermore, a proxy server may negotiate the call, regardless of the 
service provided, in some session types, including click-to-talk. Accordingly, SIP 
messages may be routed using proxy or redirect servers. In addition, SEP messages 
may be transmitted over existing IP-enablmg networks having conventional or 
advanced intelligent network elements inchiding service control points (SCPs), service 
switching pomts (SSPs) and lookup databases, such as address lookup databases. 
[0040] In the present invention, a SIP MESSAGE session is utilized by sending an 
interactive script to the user's device in the message area of a SIP message. The 
interactive script may be a programming script in an XML format In any case, 
proprietary defined SIP message headers are avoided so that a standard SIP message 
may also be used in a conventional manner without confusion in addition to the 
interactive script messaging. 

[0041] The message sent in the SIP MESSAGE session may include a questionnaire 
for use with an embedded program, such as an applet, in the client device. In the 
altemative, the message may comprise a complete self-contained program including 
an applet and questionnaire format 

[0042] An exemplary call flow using session initiation protocol messages is shown 
in FIG.s 3A and 3B. FIG.s 3A and SB show an exemplary call flow diagram that 
illustrates a sequence of operations between the various components of the system in 
the call center embodiment of the present invaition when a cUent device attempts to 
contact an agent No attempt has been made to show every possible operation in a call 
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■ flow sequence. Rather, details have been provided to e3q)lain the sequence of 
operations in a call to the exemplary call center that includes sending and receiving a 
SIP MESSAGE. 

[0043] A call flow sequence starts when a user has input enough information to a 
device to initiate a calL At step S301, the user's device sends an INVITE message to 
the automated caU distributor 160 which relays the INVITE message to a call queue 
150 at step S302. The INVITE message is an invitation to the call center to engage in 
a session. At step 8303, upon receipt of the INVITE message, the call queue 150 
indicates RINGING to the automated call distributor 160 which relays the RINGING 
indication to the client device at step S304. The RINGING message could result in the 
caller hearing a ring tone or seeing a text message indicator equivalent to a ring tone, 
such as "attempting call - no answer yet". The call queue 150 also sends an OK 
message to the client via the automated call distributor 160 to acknowledge receipt of 
the INVITE message at step S306. At step S307, the client sends an acknowledge 
(ACK) message to the ACD server, which relays the acknowledge message to the call 
queue 150. The acknowledgment message is sent to acknowledge that the call has 
been answered and a session has been established. 

[0044] Once the call is in progress with the client and call queue 1 50, i.e. once the 
call is answered at the call queue 150, a SIP MESSAGE is sent to the client with a 
markup language, such as XML, questionnaire to collect information ^rom the 
customer at step S320. The questionnaire may include a complete data request for use 
in the case when the user has not previously contacted the call center. In the 
alternative, in the case that a questionnaire has been previously submitted to the chent 
device, or if the cUent device has a compatible applet that is embedded with available 
information aheady collected, the questionnaure may include an update data request for 
the applet to use in collecting information from the caller. 

[0045] The SIP MESSAGE is sent from the call queue 150 to the automated call 
distributor 160 which relays the SIP MESSAGE to the client. The client sends a new 
OK message to the call queue 150 via the automated call distributor 160 at step S321. 
[0046] After the client application gathers the information based on the 
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' questionnaire from the customer, tiie client sends a SIP MESSAGE back to the call 
queue 150 at step S330 with the questionnaiie, and including the customer information. 
A SIP MESSAGE is sent from the client to the automated call distributor 160 which 
relays the SIP MESSAGE to the call queue 150. An OK message is sent from the call 
queue 150 to the client via the automated caU distributor 160 at step S331. 
[0047] The call queue 150 may then process the information received in the SIP 
MESSAGE from the questionnaire and determine a proper agent to handle the 
customer's call. At step S340 when an agent is available, the agent sends a SB? 
NOTIFY to the ACD server 160 which forwards a SIP NOTIFY to the call queue 150. 
[0048] If the call queue 150 determines that the agent responsible for sending the 
SIP NOTIFY can handle the call, a SIP 200 OK message is sent to the ACD server at 
step S341. The SIP 200 OK message is relayed to die agent from the ACD server 160 
along with the customer information collected from the customer using fiie text markup 
language script at step S342. 

[0049] To complete the call connection between the client and the agent, the call 
queue 150 initiates a caU transfer from the caU queue 150 to the available agent at step 
S350. The call queue 150 sends a REFER message to the ACD server which relays 
the REFER message to the cUent The client sends an OK message to the ACD server 
160 which, in turn, sends an OK message to the call queue 150 at step S352. The 
client application initiates an invitation to the agent by sending an INVITE message 
to the agent via the ACD server 160 at step S354. The agent application anticipates 
the call, having previously sent the NOTEFT message to notify the call queue 150 of 
the agent's availabihty at step S340. The agent responds to the INVITE message in 
step S354 by sending a RINGING message to the client via the ACD server at step 
S356. The agent also sends an OK message to the client via the ACD server 160 at 
step S358. At step S360, the cUent application notifies the caU queue 150, via ACD 
server 160, once the call has successfully been transferred. The call queue 150 sends 
an OK message back to the chent, via the ACD server 1 60, at step S362. As a result, 
the call is estabUshed between the cUent and the ag^t An acknowledgment (ACK) 
message is sent from the chent to the ACD server 160 which, in turn, sends an 
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' acknowledgment message to the agent at step S364. The acknowledgment messages 
conclude flie relevant SIP messaging for establishing the call between the client and 
the agent At step S366, the exemplary call is conducted between the cUent and the 
agent a using a real-time transport protocol (RTP) and/or real time conferencing 
protocol (RTCP), 

[0050] Li the caD center embodiment, in addition to utilizing the collected 
information to route the call to a proper agent, an additional benefit is realized when 
the collected information includes data that can be given to the agent at the onset of the 
call, thus better equipping the agent to handle the call. For example, using the data 
collection method disclosed herein, an agent may obtain information regarding the 
nature of the call (e.g., the call relates to the caller hearing a "buzzing" on the line) at 
virtually the same time as the caller is connected to the agent The call cemtsr agent is 
thus better equipped to handle the call with the information presented at the outset of 
the call. In the embodiment where the data is automatically collected at the client 
device, information may be available for the agent even when the call is not delayed 
at the call queue. Additionally, when the call is delayed at the call queue, a caller may 
be requested to input responses to queries from the script. 

[0051] The conventional SIP MESSAGE is sent as plain/text data and can be used 
for instant messaging. However, the method and apparatus of the instant invention use 
the SIP MESSAGE in a xmique manner by sending software instructions, such as XML 
language instructions to be executed at a user^s smart device. The rales of an XML 
format of the instant invention may follow the exemplary document type definition 
(DTD) as follows: 

<!ELEMENT questionnaire (question+)> 
<!ELEMENT question (question__text, answer__list*)> 
<! ATTLIST question answer_type (text hist IyN)> 
<!ATTLIST question question_desc (fflPCDATA)> 
<!ELEMENT text EMPTY> 
<!ELEMENT hst EMPTY> 
<!ELEMENT YN (Yes INo)> 
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iLEMENT answer Jist (#PCDATA)> 
ATTLIST answer^list ID #REQUIRED> 

052] Following is an exemplary questionnaire (script) for a client application to 

ie to collect customea: information using an interactive XML script in a SIP 

ESSAGE: 

juestiomiaire> 

<question answer_type='text' question_desc='City'> 
<question_text> 

What city are you calling from? 
</question_text> 
</question> 

<question answer_type='list' question_desc='Priority'> 
<question_text> 

How lurgent is the problem for your call? 
</question_text> 
<answer_Iist> 

<aiiswer_list_answer id='l'> 

Emergency 
</answer_hst_answer> 
<answer_list_answer id=*2'> 

Resolve as soon as possible 
</answer_list_answer> 
<answer_list_answer id='3'> 

Not immediate problem 
</answer_list__answer> 
<answer__list_answer id- 4*> 
N/A 

</answer_list_answer> 
</answer_list> 
</question> 

<question answer_1ype='YN' question_desc="Under Warranty'> 
<question_text> 

Is the product you are calling about still under warranty? 
</question_text> 
<7question> 
'questionnaire> 

>053] The use of a widely available, high level language, such as XML, aUows the 
'Stem to be dynamic. As a result, the interactive script can be built or revised by an 
vner, or authorized representative of the owner, at virtually any time from any 
cation provided with access to a communications network supporting one of the 
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modes used by the data collection system. For example, an internet web page may be 
provided that will allow an authorized user to customize tiie script as the user desires. 
[0054] According to the present invention, a script manager may use a manager 
device, such as ^top, scanner or desktop to access the stored script in order to modify 
the interactive script Accordingly, a manager may modify the script from virtually 
anywhere in the world at any time if the manager can access an IP compatible 
communications network, includiug wireless and the PSTN. Instructions for 
modifications can be translated into text by any known translation method that 
translates, for example, voice instructions or handwriting into a text format for entry 
at the script storage. In an CTubodiment, the script manager will be versed in XML and 
will provide the instructions directly to flie script storage, after satisfying any necessary 
security procedures to access the storage. 

[0055] As shown in FIGS. 3A and 3B and discussed above, in a call center 
embodiment, the script is forwarded when a call is established at the call queue 150. 
The script is sent as part of an SIP MESSAGE to the client. The client responds with 
a SIP OK message indicating that it received the message from the call queue 150. 
The client application then processes the script. The smart device may be provided 
with voice capability, including voice recognition, so that a caller may be prompted 
vocally to provide information responsive to the script when the text markup language 
script is translated into voice Signals. The smart device may be provided with text 
c^ability, including character or handwriting recognition, so that a caller may be 
visually prompted with a text message to provide information responsive to the script. 
The caller may be prompted to speak, type or write a response if the information 
cannot be automatically collected at tiie applet Moreover, similar to the conventional 
applet that is provided for intemet use, the smart device may be provided with the 
capability to automatically provide information responsive to the script 
[0056] The smart device may ftirther support translation of speech or audio tones 
into text. Accordingly, a user may provide voice or digital input responsive to queries 
generated using the script For example, the smart device may translate a DTMF 
signal, such as a digitally input DTMF signal from a keypad, into a textual 
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representation of the keypad or keyboard character tiiat was pressed by the user. 
AdditionaUy, the smart device may support VoiceXML and HTML speech tags that 
represent textual translation of a user's voice input responsive to a script query. The 
textual representation of the DTMF, VoiceXML, HTML or other audio-to-textual 
format, may be forwarded with flie script for use of Ae party that sent the script to the 



user. 



[0057] The script is updated with the answers to the script queries. When the script 
queries are exhausted, the client appUcation sends a SIP MESSAGE to the call queue 
150 with the questionnaire and the responsive information. The call queue 150 may 
use the received answers, e.g., to determine the call agent best enable of handling the 
caU. Furthermore, the call queue 150 may store the data in, for example, a database, 
so that the information can be transferred to an agent when an appropriate agent is 
available. 

[0058] An exemplary script question is pres«ited below: 

MESSAGE sip:customer@clientcom SlP/2.0 

Via: SIP/2.0/IJDP ACDServer.com 

Via: SIP/2.0/Ur)P CallQueue.com 

From: sip:SBC@CallQueue.com 

To: sip:customer@client.com 

Call-ID: SBCtoCustomerCall_12345 

Cseq: 1 MESSAGE 

Content-Type: text/xml 

Content-Length: 187 

<questionnaire> 

<question answer_typ^'text' question_desc='City'> 

<question_text> 

What city are you calling form? 

<7question_text> 
</question> 
</questionnaire> 
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[0059] An exemplary message, responsive to the above-described question, appears 
below: 

MESSAGE sip:SBC@CallQueue.com SIP/2.0 

Via: SIP/2.0AJbP ACbServer.com 

Via: SIP/2.0/UDP CUent.com 

From: sip:customer@Clientcom 

To: sip:SBC@CallQueue.com 

Call-ID: SBCtoCutomerCall_12345 

Cseq: 1 MESSAGE 

Content-Type: text/xml 

Content-Length: 215 

<questionnaire> 

<question answer_type='text' question_desc=<'City'> 
<questioii_texe> 

What city are you calling from? 
<yquestion_text> 
<answer> 

Austin 
</answer> 
</question> 
</questioimaire> 

[0060] Another embodiment of the present invention allows a user of a smart device 
to contact a provider of self-installiQg content, such as a software program. The 
provider may send a SIP message with the interactive script to determine the capabiUty 
of the user's smart device. For example, the script may verify that sufficient memory 
exists at the smart device to download a requested program. Additionally, the script 
may verify that the program is supported by the smart device, as in the case of a user 
with a conventional computer terminal requesting a large program that provides audio 
when the conventional computer terminal is not equipped with speakers or an audio 
output capabihty . Accordingly, use of the interactive script may help a user of a smart 
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" device avoid ordering an item that is not supported by the user's device. The invention 
may further enable a user to automatically provide information from peripheral 
devices, or even household devices, if the user's smart device is networked to other 
smart appliances, such as a refrigerator or heating/air-conditiomng system via home 
networking equipment. Accordingly, accurate technical and diagnostic information 
may be provided using the SIP messages to collect information in order to efficiently 
and remotely provide infomiation to a service provider and to enable collection of 
technical and diagnostic infomiation. 

[0061] Another embodiment of the present invention involving a call from one 
private citizen to another private citizen could use a similar structure to that shown in 
FIG L For example, a caller may use an IP-enabled mobile device to call a second 
party equipped with at least a server and another IP-enabled cormnunications device. 
For exanq)le, a user/caller using a PDA 102 could be a child or teenager instructed to 
call home. The PDA 102 may be equipped with a global positioning system (GPS) 
receiver. In an embodiment, the second party's IP-enabled device may be equipped 
with conventionally available mapping software that provides U.S. street maps 
corresponding to GPS data. Accordingly, a teenager (first party) calling home using 
the PDA may automatically receive an inquiry from their home system (second party) 
to verify their location. The inquiry may be in the form of a complete applet with a 
questionnaire, or merely a questionnaire to be processed by an embedded applet As 
in the first embodiment, the questionnaire with appropriate instractions is transmitted 
in a SIP MESSAGE. The return SEP MESSAGE with the answers to the queries, 
including GPS data in any character format recognized by the mapping software, may 
be received at tiie second party's IP-enabled device and automatically translated for use 
of the mapping software. Accordingly, in an embodiment using SIP messaging to 
inquire of a calling party's device, a parent may verify a child's location, almost in real 
time, and display the location on a map. 

[0062] In another embodiment, a security service may call a smart device to request 
information regarding a home security systan. The user of the smart device may be 
prompted to input information in response to a message containing an interactive script 
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from the security service. For example, a party may be prompted to input authorization 
information before being connected to an agent wishing to personally verify that an 
alarm has been improperly activated at the called party's residence. A security system 
or other service may additionally request a caller to remotely program a security 
system, such as system activation and de-activation times using the script 
[0063] As is illustrated, the present invention can be used to obtain almost any 
maimer of caller data that can be provided at a next generation communications device 
and translated into a format for communication over a network. Working v^ith 
translation software, the present invention can enable a called party to automatically 
query a caller's device or a caller in a manner and order programmed by the script 
nianager. 

[0064] Although the invention has been described with reference to several 
exemplary embodiments, it is imderstood that the words that have been used are words 
of description and illustration, rather than words of limitation. Changes may be made 
within the purview of the appended claims, as presently stated and as amended, 
v^fhout departing from the scope and spirit of the invention in its aspects. Alfhou^ 
the invention has been described with reference to particular means, materials and 
embodiments, the invention is not intended to be limited to the particulars disclosed; 
rather, the invention extends to aU functionally equivalent structures, methods and uses 
such as are within the scope of the appended claims. 

[0065] In accordance with various embodiments of the present invention, the 
methods described herein are intended for operation as software programs running on 
a computer processor. Dedicated hardware implementations including, but not limited 
to, application specific integrated circuits, programmable logic arrays and other 
hardware devices can likewise be constructed to implement the methods described 
herein. Furthermore, alternative software implementations iacluding, but not limited 
to, distributed processing or component/object distributed processing, parallel 
processing, or virtual machine processing can also be constructed to implement the 
methods described herein. 

[0066] It should also be noted that the software implementations of the present 
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invention as described herein are optionally stored on a tangible storage medium, such 
as: a magnetic medium such as a disk or tape; a magneto-optical or optical medium 
such as a disk; or a solid state medium such as a memory card or other package that 
houses one or more read-only (non-volatile) memories, random access memories, or 
other re-writable (volatile) memories. A digital file attachment to email or other self- 
contained information archive or set of archives is considered a distribution medium 
equivalent to a tangible storage mediiun. Accordingly, the invention is considered to 
include a tangible storage medium or distribution medium, as listed herein and 
including art-recognized equivalents and successor media, in which the software 
inaplementations herein are stored. 

[0067] Although the present specification describes components and fimctions 
implemented in the embodimaits with referraice to particular standards and protocols, 
the invention is not limited to such standards and protocols. Each of the standards for 
Intemet and other packet switched network transmission pubUc telephone networks 
represent examples of the state of the art. Such standards are periodically superseded 
by faster or more ejBBcient equivalents having essentially the same fimctions. 
Accordingly, replacement standards and protocols having the same fimctions are 
considered equivalents. 
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' What is claimed is: 

1 . A method of collecting information from a communications device of a user for 
use by a recipient device, the method comprising: 

estabUshiQg a coromunications connection between the communications device 
and the recipient device; 

sending an interactive text markiq) programming language script to the 
commnnications device usiag a session initiation protocol (SIP) message, the script 
comprising at least one query; and 

receiving an answer from the connnunications device in response to the at least 
one query, the answer being at least one of generated by the communications device 
and based upon an input by tiie user. 

2. The method of collecting information of claim 1 , further comprising: 
sending a message to one of a proxy server and a redirect server to estabUsh the 

cominunications connection between the communications device and the recipient. 

3. The method of collecting information of claim 1, wherein the received answer 
comprises information of at least one of a location of the communications device, a 
type of the communications device, a communications format used by the 
communications device, a communications mode desired by the user of the 
communications device, a personal identification of a user of the conmnmications 
device, an account number of a user of tiie commimications device, a password of a 
user of the communications device, billing information of a user of the 
communications device, the intent of the user of the communications device, a 
preferred language of a viser of the communications device, and a question from the 
user of the communications device. 

4. The method of coDecting information of claim 1 , wherein the received answer 
is a textual representation of one of a DTMF tone, VoiceXML and HTML speech tags. 

5 . The method of collecting information of claim 1 , further comprising providing 
the answer to a user of the recipient device. 



23 



wo 03/073731 PCT/US03/03129 

6. The method of coUectmg information of claim 1 , the at least one query of the 
interactive script comprising a first query and a second query that depends on the 
answer to the first query. 

7. The method of coUecting information of claim 1, further comprising taidalipg 
the answer to the interactive script into text 

8. A method of determining a final call destination for a caller using a 
conmnmications device, the method conqjrising: 

sending an interactive text markup programming language script, including at 
least one query, from a caU queue to the communications device using a session 
initiation protocol (SEP) message; 

attenq)ting to receive an answer to each query from the communications device 
without requiring an action by the user of tite communications device; 

for each query that is not answered by the communications device, presenting 
the queiy to the caller via a user interfece associated with the communications device; 

when an answer to the at least one query has been received from one of liie 
communications device and the caUer, returning the script and the answers to the call 
queue; and 

determining the call destination based on tiie answers to the at least one query. 

9. The method of determining the final call destination of claim 8, further 
conq)rising sending a liiessage to one of a proxy server and a redirect server. 

1 0. The method of determining the final call destination of claim 8, wherein the 
received answer comprises information of at least one of a location of the 
communications device, a type of the communications device, a communications 
format used by the comrauiiications device, a communications mode desired by the 
user of the communications device, a personal identification of a user of the 
communications device, an account number of a user of the communications device, 
a password of a user of the communications device, billing information of a user of flie 
communications device, the intent of the user of the communications device, a 
preferred language of a user of the communications device, and a question from the 
user of the communications device. 
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11. The method of detennining the final caU destination of claim 8, wherein the 
received answer is a textual representation of one of a DTMF tone, VoiceXML and 
HTML speech tags. 

12. The method of determining the final call destination of claim 8, fiirther 
comprising providing the answers to an agent at the final call destination. 

13 . The method of detennining the final call destination of claim 8, the at least one 
query of tite interactive script comprising a first query and a second query that depends 
on the answer to the first query. 

14. The method of determining the final call destination of claim 8, fiirther 
comprising translating the answers to the interactive script into text. 

15. A method of interactively pre-screening caller informatiori of a caller using a 
communications device, the method comprising: 

establishing a communications connection between an information service and 
the communications device; 

sending an interactive text markup programming language script, including at 
least one query, firom the information service to the commimications device used by 
the caller, using a session initiation protocol (SIP) message; 

integrating into the iateractive script at least one answer to the query, the answer 
being at least one of generated firom the communications device and generated firom 
input by the caller; 

receiving the script, including the at least one answer, at the information 
service. 

16. The method of interactively pre-screening caller information of claim 15, 
fiirther comprising establishing a second connection between the communications 
device and one of a plurality of agent devices using SIP, the one of the plurality of 
agent devices being determined based on the at least one answer, 

17. The method of interactively pre-screening caller information of claim 15, 
whereiQ the establishing a communications connection fiirther comprises sending a 
message to one of a proxy server and a redirect server. 
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18. The method of interactively pre-screening caller informatioii of claim 15, 
wherein the received answer comprises infonnation of at least one of a location of the 
communications device, a type of the commtmications device, a commnnications 
format used hy the conmiunications device, a communications mode desired by the 
user of the communications device, a personal identification of a user of tiie 
communications device, an account number of a user of the communications device, 
a password of a user of the conmiunications device, billing infonnation of a user of the 
communications device, the intent of the user of the communications device, a 
preferred language of a user of the communications device, and a question fi-om the 
user of the communications device. 

19. The method of interactively pre-screening caller information of claim 15, 
wherein the received answer is a textual representation of one of a DTMF tone, 
VoiceXML and HTML speech tags. 

20. The method of interactively pre-screening caller information of claim 15, 
further comprising providing the answer to an agent of the information service at an 
agent tenrdnal. 

2 1 . The method of interactively pre-screening caller information of claim 1 5 , the 
at least one query of the interactive script comprising a first query and a second query 
that is based upon a first answer to the first query. 

22. The method of interactively pre-screeniug caller information of claim 15, 
fiirther comprising translating the answers to the interactive script into text 

23. A computer readable medium for storing a computer program that controls 
collection of information firom a communications device of a user for receipt at a 
recipient device, the computer readable medium comprising: 

a session initiation protocol (SIP) segment that creates a session initiation 
protocol message that is sent to the commimications device, the SIP message 
comprising an interactive text markup programming language script that comprises at 
least one query; 
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a data processing segment that receives and analyzes an answer from the 
commimications device in response to the at least one query, , the answer being at least 
one of generated by the communications device and based upon an input by the user. 
24. An data reception system that receives collected data from a communications 
device of a caller, comprising: 

an automated caU distributor that receives a call from a user device over an IP 
network; 

a call queue that receives the call from the automated call distributor and sends 
an interactive text markup programming language script, including at least one query, 
to the communications device of the caller using a session initiation protocol (SIP) 
message; 

a media gateway that translates the script into a format compatible with the 
commimications device of the caller when the commimications device of the caller 
does not support the modality of the script as sent by the call queue; 

wherein when an answer to the at least one query has been received from one 
of the communications device and the caller, the answers are processed at the call 
queue. 
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